NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Virtual Gang Scheduling of Parallel Real-Time Tasks

Ali, Waqar; Pellizzoni, Rodolfo; Yun, Heechul (January 2021, 2021 Design, Automation & Test in Europe Conference & Exhibition (DATE))

Full Text Available
Dynamic Memory Bandwidth Allocation for Real-Time GPU-Based SoC Platforms

https://doi.org/10.1109/TCAD.2020.3012210

Aghilinasab, Homa; Ali, Waqar; Yun, Heechul; Pellizzoni, Rodolfo (November 2020, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)
null (Ed.)
Full Text Available
RT-Gang: Real-Time Gang Scheduling Framework for Safety-Critical Systems

https://doi.org/10.1109/RTAS.2019.00020

Ali, Waqar; Yun, Heechul (April 2019, 2019 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS))

Full Text Available
RT-Gang: Real-Time Gang Scheduling Framework for Safety-Critical Systems

Ali, Waqar; Yun, Heechul. (January 2019, Proceedings - IEEE Real-Time and Embedded Technology and Applications Symposium)

In this paper, we present RT-Gang: a novel realtime gang scheduling framework that enforces a one-gang-at-atime policy. We find that, in a multicore platform, co-scheduling multiple parallel real-time tasks would require highly pessimistic worst-case execution time (WCET) and schedulability analysis—even when there are enough cores—due to contention in shared hardware resources such as cache and DRAM controller. In RT-Gang, all threads of a parallel real-time task form a real-time gang and the scheduler globally enforces the one-gangat-a-time scheduling policy to guarantee tight and accurate task WCET. To minimize under-utilization, we integrate a state-of-the-art memory bandwidth throttling framework to allow safe execution of best-effort tasks. Specifically, any idle cores, if exist, are used to schedule best-effort tasks but their maximum memory bandwidth usages are strictly throttled to tightly bound interference to real-time gang tasks. We implement RT-Gang in the Linux kernel and evaluate it on two representative embedded multicore platforms using both synthetic and real-world DNN workloads. The results show that RT-Gang dramatically improves system predictability and the overhead is negligible.
more » « less
Full Text Available
Protecting Real-Time GPU Kernels on Integrated CPU-GPU SoC Platforms

https://doi.org/10.4230/LIPIcs.ECRTS.2018.19

Ali, Waqar; Yun, Heechul (July 2018, Leibniz international proceedings in informatics)

Integrated CPU-GPU architecture provides excellent acceleration capabilities for data parallel applications on embedded platforms while meeting the size, weight and power (SWaP) requirements. However, sharing of main memory between CPU applications and GPU kernels can severely affect the execution of GPU kernels and diminish the performance gain provided by GPU. For example, in the NVIDIA Jetson TX2 platform, an integrated CPU-GPU architecture, we observed that, in the worst case, the GPU kernels can suffer as much as 3X slowdown in the presence of co-running memory intensive CPU applications. In this paper, we propose a software mechanism, which we call BWLOCK++, to protect the performance of GPU kernels from co-scheduled memory intensive CPU applications.
more » « less
Full Text Available

Search for: All records